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Description 

^/ 

Minl-proiiuiulin, its P^i^t^im^and uge_ 

5 whlcKT''" '° " ""^^^ -.nini-proinBulin. in 

Which the unshortened b chain is only bonded to the A 
Chain via an arginine residue. Human insulin is access- 
reactlor ^^^"^ "^"^"P^^i'^^-lin without difficult chemical 

10 ^iiusTtT'r • Shortened C chain are known, 

acribed a proinsulin having a C chain shortened to six 

^e" " . ^'^^^'^^ application having 

the publication number (ep-a, 0,055,945, corresponding 

i5 Shortened to two amino acids. 

rri* ^■"""■> precursors having s shortened 

B ch,in ere dlscloseo 1„ which the c chain is either 

"T" '"^''^^^ ~ """.ned to one e.ino ecid. 

These precursor, ere converted into i«ture hu».n iisulin • 

i^i^r?'""''"" ".nspeptidstion using .n .-t;:::o::: 

?3/sr,°*T '^^ ^"""o" relates to hu-an Des- 

(32-65) proinsulin or nini-proinsulin o£ the formula I 

B(l-30)-Arg-A(l-2l) (ij, 

S in Which B(l-30, and A(l-2l, denote the B and A chain of 
hu«n insulin. This compound is not o^ly ^ H 

dis^lo TT r'""""" which has 

b^n di.cios«i in European Patent No., (bp-b, 0,132 769 

ctt I"i';s"i°: V " -hi 

certain insulin activity itself. 



The invention therefore also relates to the compound of 
the formula I for use as a pharmaceutical, in particular 
for the treatment of diabetes mellitus, and furthermore 
pharmaceuticals containing the compound of the formula 
I, and pharmaceuticals composed of a pharmacologically 
acceptable excipient and the compound of the formula I. 

The invention furthermore relates to the use of the 
compound of the formula I for the preparation of the 
mono-Arg insulin of the formula ii 

S - S 
I I 
A(l - 21) 

f f 

S S . 

I I 
B(l - 30) - Arg 

in which A (1-21) and B(l-30) have the meanings mentioned 
previously and the -S-S-bridges are arranged as in 
insulin, and of human insulin by enzymatic cleavage. The 
immediate conversion • of the compound of the formula I 
into insulin in a "one-pot reaction" is particularly 
advantageous . 

The invention furthermore relates to a process for the 
preparation of the coinpound of the formula I which 
comprises expressing a gene structure encoding for this 
compound in a host cell, preferably in a bacterium such 
as E. coli or in a yeast, in particular Saccharomyces 
cerevisiae, and, if the gene structure encodes for a 
fusion protein, liberating the compound of the formula I 
from this fusion protein. The invention relates in 
addition to DMA sequences which encode for the compound 
of the formula I, gene structures or plasmids which 
contain this DMA, and host cells, in particular bacteria 
such as E. coli or yeast cells, particularly yeasts of 
the strain Saccharomyces cerevisiae which contain such 
gene structures or plasmids. The invention furthermore 
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relates to fusion proteins which contain a compound of 
the formula 1, preferably fusion proteins in which the 
compound of the formula I is bonded via the bridging 
member 



- Met - He - Glu - Gly - Arg - 
to the "ballast component- of the fusion protein. 



Further preferred embodiments of the invention 
explained in more detail in the following. 



are 



The figures are used to explain the Examples, Fig. 1 (and 
its continuation in Pig. la and lb) showing the construe 
tion of the E. coli expression vectors PIKIO and pSW3 and 
Fig. 2 (and its continuation in Fig. 2a and 2b) showing 
that of the yeast expression vector pafB102 or pofB104. 
These vectors encode for mini -proinsulih. 

It has been found that the mini-proinsulin has the 
correct folding so that mono-Arg insulin is formed almost 
quantitatively after cleavage with trypsin, a surpris- 
ingly simple process for the preparation of mono-Arg 
insulin thus results. Human insulin can be prepared from 
this in a manner known per se. Mono-Arg insulin is 
furthermore used as an active compound in pharmaceuticals 
(EP-B 0,132,769). 

The expression vector pK50 was described in EP-A 
0,229,998 or in the corresponding AU-A 66,760/86. Mini- 
proinsulin can be prepared in a bacterium such as E. coli 
in the form of a fusion protein corresponding to this 
construction. 

The poorly soluble fusion protein can be concentrated by 
washing with neutral buffer solutions. The mini-pro- 
30 insulin is liberated by cyanogen halide cleavage 
(E. Gross and B. Wittkop, j. Am. Chem. Soc. £2 (1961) 
1510-1517). This still does not exist in the 



biologically active form, but consists of a non-uniform 
mixture having various inter- and intramolecular disul*. 
fide bridges, possibly also with other protein fragments. 
The S-sulfonate form of the molecule is prepared as a 
chemically uniform, relatively stable derivative (P.G. 
Katsoyannis et al.. Biochemistry £ (1967) 2635-2641). 
This derivative can be purified very easily by ion 
exchange chromatography and is an approved starting 
material for the folding into the native spatial struc- 
ture with the formation of the correct disulfide bridges 
(Y,C. Du et al., Sci. Sin. IS (1965) 229-236; H. P. 
Gattner et al., Hoppe-Seylers Z. physiol. Chem. 362 
(1981) 1943-1049; B.H. Prank et al. in: "Peptides: 
Synthesis-Structure-Punction", D.H. Rich and E. Gross, 
Publishers, (1981) 1043-1049). The success of this 
folding is safeguarded by HPLC analysis of the fragments 
resulting after cleavage with . S. aureus - protease V8 
(U. Grau, Diabetes 14 (1985) 1174-1180). 

The liberation of mono-Arg insulin or insulin by the 
action of trypsin or carboxypeptidase B or by enzymes 
having the same action (W. Kemmler et al., J, Biol. Chem. 
m (1971) 2780-2795) proceeds in a decidedly uncompli- 
cated manner, since it proves particularly advantageous 
for this purpose that the number of possible cleavage 
sites is reduced compared to normal proinsulin. Because 
of this, the cleavage is considerably more simple to 
control (with respect to the formation of side products 
as in the preparation of Des-B30 insulin or Z>esocta-B23- 
B30 insulin). Both mono-Arg insulin and also insulin can 
be isolated in a known manner in highly pure form by ion 
exchange chromatography. The formation of the insulin 
and the mono-Arg derivative, the course of the purifica- 
tion and the quality of the final product are checked 
using customary RP-HPLC methods (G. Seipke et al., Angew. 
Chem. as (1986) 530-548), 

Surprisingly, hardly any insulin Des-BSO is formed in the 
cleavage of mini-proinsulin in contrast to natural 



proinsulln. Since the latter can only be separated from 
insulin in a very difficult manner, the "two-pot reac-- 
tion" is preferred in the production of insulin from 
natural proinsulln, i.e. the principal products of the 
tryptic cleavage, insulin-Arg^^^-Arg*" and mono-Arg insulin 
are first separated from insulin Des->B30 via ion exchan- 
gers in a known manner and then cleaved to give human 
insulin by means of carboxypeptidase B (EP-B 0,195,691). 
In comparison, mini-proinsulin can be converted to human 
insulin in an ideal manner in a "one-pot reaction** 
simultaneously using trypsin and carboxypeptidase B or by 
means of enzymes having the same action. 

The expression of the compound of the formula I in yeast 
with subsequent secretion is particularly advantageous, 
since the correctly folded proinsulln derivative can be 
isolated directly. Teasts are used as host systems, as 
are shown, for example, in EP-A 0,248,227, thus for 
example B. Pichia pastoris, Hansenula polymorphis, 
Schizosaccharomyces pombe or, preferably Saccharomyces 
cerevisiae. 

Vectors for expression in yeasts are known in large 
numbers . The preparation of the insulin derivative 
according to the invention is described in the following 
with the aid of the yeast a-factor system which, however, 
is only to be taken as an example, since other expression 
systems can also be employed in a manner known per se. 

The structure of the yeast pheromone gene MFa is known 
from the publication Kurjan and Herskovitz, Cell 30 
(1982) 933-943, where the possibility of the expression 
of other genes and the secretion of the gene products are 
also discussed. Regarding this, reference can also be 
made to Brake et al., Proc. Natl. Acad. Sci. USA 81 
(1984), 4642-4646. 



Alternatively, a yeast "Killertoxin" system can be used, 
or secretion via the acidic phosphatase or invert ase 
system can be utilized. 

As yeast vectors, so-called "shuttle" vectors are advan- 
tageously used which have a bacterial plasmid and a yeast 
plasmid replication origin and also a gene or genes for 
selection in both host systems. In addition, such 
vectors contain the promoter sequences necessary for the 
expression of foreign genes and, if appropriate, a 
terminator sequence to improve the yield so that the 
heterologous gene - expediently fused to secretory 
signals - is arranged between promoter and terminator. 
Such vectors are described, for example, in US-A 
4,766,073. 

The genetic code is, as is known, "degenerate", i.e. a 
single nucleotide sequence encodes only for two amino 
acids, while the residual 18 encodable amino acids are 
allocated two to six triplets. For the synthesis of the 
gene for mini-proinsulin, there is thus a large variety 
of codon- combinations, to be chosen. It has now been 
found that the DNA sequence I encoding for mini-pro- 
insulin (which is reproduced in the Appendix in the form 
of the two gene fragments IK I (Table 1) and IK II (Table 
2)) is particularly advantageous, since it is optimized 
to codon use of both yeast and E. coli. 

A "protruding" DNA sequence corresponding to the restrict 
tion endonuclease Kpnl is situated at the 5' end of the 
encoding strand of the DNA sequence 1. in comparison, 
the single-stranded sequence corresponding to the restri- 
ction enzyme Hindi 1 1 is protruding at the 3' end of the 
encoding strand. These two different recognition sequen- 
ces ensure the insertion of the DNA sequence I into 
plasmids in the desired orientation. Two translation/- 
termination codons (stop codons) follow triplet No. 65 
for asparagine. in the encoding sequence. An internal 
unique cleavage site for the restriction enzyme Pst I 



(codon 41/42) makes possible the sub-cloning of two gene 
fragments which can be incorporated in well investigated 
plasmids such as pUC18 or derivatives of these plasmids. 



Additionally, a number of further unique recognition 
sequences for restriction enzymes can be incorporated 
within the structure gene, which on the one hand provide 
access to partial sequences of proinsulin and on the 
other hand permit mutations to be carried out: 

Restriction enzyme Cut after nucleotide no. 

(encoding strand) 



ACCi 


201 


Drain 


46 


FnuDII 


107 


Hgal 


105 


Hindi I I 


213 


Hlnfl 


17 


Hpal 


22 


HphI 


76 


Mael 


155 


Maelll 


191 


MboII 


89 


Hlul 


106 


Ncol 


207 


Nlalll 


208 


PvuII 


175 


Sail 


201 


Spel 


154 


Styl 


207 


TaqI 


202 



The DHA sequence I was modified from the natural seqn^ence 
at essential points. In this way, the insertion of the 
numerous unique cleavage sites for restriction enzymes 
was possible. 



The DNA sequence I can be constructed from a total of 6 
oligonucl otides having a chain length of 47 to 96 



nucleotide units. For this purpose, the procedure is as 
described in the following. 

The gene fragment IK I (Table 1) can be constructed from 
4 oligonucleotides having a chain length of 47 to 74 
units by first synthesizing these chemically and then 
linking them enzymatically via "sticky ends*' of 3 nucleo- 
tides. The sticky ends correspond to those of the 
restriction enzyme Dralll, which is advantageous for 
later modifications. 

The gene fragment IK II (Table 2) can be obtained from 
two chemically synthesized oligonucleotides having a 
length of 88 and 96 nucleotide units. 

Example 1 

a) Chemical synthesis of a single-stranded oligo- 
nucleotide 

The synthesis of the DNA building blocks is explained 
using oligonucleotide No. 4 (Table 1) as an example. For 
the solid phase synthesis, the nucleoside at the 3' end, 
i.e. adenine (nucleotide No. 125) in the present case, is 
used bonded covalently to a support via the 3'-hydroxyl 
function. The support material is CFG ("controlled pore 
glass**) functionalized with long-chain aminoalkyl radi- 
cals. 

In the following synthetic steps, the base component is 
employed as ^-cyanoethyl N, N ' -dialkyl-5 ' -0-dimethoxy- 
tritylnucleoside-3'-phosphoramidite, where the adenine is 
present as the N^-benzoyl compound, the cytosine as the 
N*-benzoyl compound, the guanine as the N^-isobutyl 
compound and the thymine without a protective group. 



25 mg of the polymeric support, which contains 0.2 /imol 
of 5 ' -0-dimethoxytrityl-N*-benzoyl-2 ' -desoxyadenosine 



bonded, are successively treated with the following 
agents : 

A) acetonitrile 

B) 3% trichloroacetic acid in dichloromethane 

C) acetonitrile 

D) 5 /Amol of the appropriate nucleoside-3 '-0-phosphite 
and 25 /imol of tetrazole in 0.15 ml of anhydrous 
acetonitrile 

E) acetonitrile 

F) 20% acetic anhydride in tetrahydrofuran containing 40% 
lutidine and 10% dimethylaminopyridine 

G) acetonitrile 

H) 3% iodine in lutidine /water /tetrahydrofuran in a 
volume ratio of 5:4:1 

In this connection, "phosphite- is taken to mean ^-cyano- 
ethyl 2'-desoxyribose-3'-monophosphite, the third valency 
being satisfied by a diisopropylamino radical. The 
yields of the individual synthesis steps can in each case 
be determined spectophotometrically by the detritylation 
reaction B) by measuring the absorption of the dimeth- 
oxytrityl cation at the wavelength of 496 nm. 

After synthesis has been concluded, the cleavage of the 
dimethoxytrityl group is carried out as described in A) 
to C). The oligonucleotide is cleaved from the support 
by treatment with ammonia and the ^-cyanoethyl groups are 
eliminated at the same time. The amino protective groups 
of the bases are cleaved quantitatively by treatment of 
the oligomers with concentrated ammonia at 50°C for 16 
hours. The crude product thus obtained is purified by 
polyacrylamide gel electrophoresis. 

The oligonucleotides 1 -3 (Table 1), 5 and 6 (Table 2) 
are prepared in an analagous manner. 



b) Enzymatic linkage o£ the single-stranded oligo- 
nucleotides 

For enzymatic phosphorylation of the oligonucleotides at 
the 5' terminus, each 1 ^imol of the oligonucleotides 1 
and 4 Is treated for 30 minutes at 37**C with four units 
of T4 polynucleotide kinase In 20 ^il of 50 mM trls-HCl 
buffer (pH 7.6), 10 mM of magnesium chloride and 10 mM of 
dlthlothreltol (DTT) • The enzyme Is Inactivated by 
heating to 95**C for 5 minutes. The oligonucleotides 2 
and 3, which form the "protruding" single-stranded 
sequences, are not phosphorylated* This prevents the 
formation of larger gene fragments In the subsequent 
ligation. 

The oligonucleotides 1 to 4 are llgated as follows: each 
1 /imol of the oligonucleotides 1 and 2 or 3 euid 4 are 
hybridized In pairs by dissolving these In each case In 
20 m1 of 50 mM trls-HCl buffer (pH 7.6), 10 mM of mag- 
nesium chloride and 10 mM of DTT, heating this solution 
to 95**C for 5 minutes and cooling to room temperature 
within 2 hours.. For this purpose,: the. oligonucleotides 
1 and 4 are employed In the form of their 5 ' -phosphates . 
For further linking of the blhellcal DNA fragments 
formed, the solutions of these are combined, warmed to 
60**C for 15 minutes and cooled to room temperature. 2 /il 
of 0.1 M DDT, 16 ^1 of 2.5 mM adenosine triphosphate (pH 
7) and 1 ;il of T4 DNA llgase (400 units) are then added 
and the mixture Is Incubated at 22**C for 16 hours. 

The purification of the gene fragments thus obtained 
(Tables 1 and 2) Is carried out by gel electrophoresis on 
a 10% strength polyacrylamlde gel (without addition of 
urea, 40 x 20 x 0.1 cm), *X174 DNA (BRL) cleaved using 
Hlnfl, or PBR322, cut using Haelll, being used as a 
labeling substance. 



Example 2 



a) Cloning of the synthesized DNA fragments 

The commercial plasmid pUC19 is opened using the restric- 
tion enzymes Kpnl and PstI and the large fragment (1) is 
separated through a 0.8% strength "Seaplaque" gel. This 
fragment is reacted with T4 DNA ligase using the DNA (2) 
synthesized according to Table 1 and the ligation mixture 
is incubated with competent E. coli 79/02 cells. The 
transformation mixture is plated out on IPTG/Xgal plates 
which contain 20 mg/1 of ampicillin. The plasmid DNA is 
isolated from the white colonies and characterized by 
restriction and DNA sequence analysis. The desired 
plasmids are called pIKl. 

Accordingly, the DNA (5) according to Table 2 is ligated 
into pUC19 which has been opened using Psti and' Hindi II 
(4), The plasmid plK2 (6) is obtained. 

b) Construction of the mini-proinsulin gene 

The DNA sequences (2) and (5) according to Table 1 and 2 
are reisolated from the plasmids pIKl (3) and pIK2 (6) 
and ligated with pUC19, which has been opened using Kpnl 
and Hindlll (7). The plasmid plK3 (8) is thus obtained 
which encodes for a modified human insulin sequence. 

The plasmid pIK3 (8) is opened using Mlul and Spel and 
the large fragment (9) is isolated. This is ligated with 
the DNA sequence (10) 

B30 Al A2 A3 A4 A5 A6 A7 A8 A9 

(Thr) (Arg) Gly lie Val Glu Gin Cys Cys (Thr) (Ser) 

5' CG CGT GGT ATC GTT GAA CAA TGT TGT A 

3' A CCA TAG CAA CTT GTT ACA ACA TGA TC 

(MLUI) 



(10) 
3' 
5' 

(SJpel) 
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which supplements the last codon of the B chain (B30) by 
one arginine codon and replac s the excised codon for the 
first 7 amino acids of the A chain and supplements the 
codon for the amino acids 8 and 9 of this chain. 

5 The plasmid pIK4 (11) is thus obtained, which encodes for 
the human mini-proinsulin according to the invention* 

c) Expression vectors for mini-proinsulin 

pIK I: 

The plasmid pK50 (12) known from EP-A 0,229,998 (Example 
10 3 therein; Figure 3 (33)) is cleaved using EcoRI and 

Hindlll, Both fragments (13) and (14) are isolated. The 
small fragment (14) containing the IL-2 partial sequence 
is subsequently, cleaved with Mlul .and the IL-2 partial 
sequence (15) is isolated. 

15 The plasmid pIK4 (11) is cleaved using EcoRI and Hpal and 

the large fragment (16) is isolated. This is now ligated 
with the IL-2 partial sequence (15) and the synthetic DNA 
(17) 

B^ 

20 Met He Glu Gly Arg Phe Val 

5' CG C6T AT6 ATT GAG GGC CGT TTC GTT 3' (17) 
3' A TAG TAA CTC CCG GCA AAG CAA 5' 

(Mlul) (Hpal) 

the plasmid plK8 (18) being obtained. This encodes for 
25 a fusion protein in which a bridging member Met<-Ile-61u- 
Gly-Arg and then the amino acid sequence of the mini- 
proinsulin follow the first 38 amino acids of the IL-2. 

The EcoRI-Hindlll fragment which encodes for the fusion 
protein mentioned is excised from the plasmid pIK8 (18). 
30 This fragment is ligated with the large fragment (13) 
which was obtained in the cleavage of pK50. The dxpres- 
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eion vector pIKlO (20) which encodes for the previously 
characterized fusion protein is thus obtained. 

pSW3: 

If the Ndel-BstEII segment which includes the "bom site" 
is removed from the vector pIKlO (20), a vector is 
obtained which is present in the cell in a relatively 
high copy number and - on account of the missing "bom 
site" - can no longer be mobilized by conjugative plas- 
mids • 

For this purpose, the vector pIKlO (20) is cleaved using 
BstEII and Ndel, which is precipitated using ethanol, 
tranferred in DNA polymerase buffer and subjected to a 
Klenow polymerase reaction* The truncated DNA fragments 
thus formed are separated by gel, electrophoresis and the 
larger fragment (21) is isolated. The vector pSW3 (22) 
is obtained by ligation. After transformation of com- 
petent £• coli-Mcl061 cells and amplif icration, the 
plasmid pSW3 (22) is isolated and characterized. 

Example 3t Expression in the strain E. coli W3110 

An overnight culture of E. coli cells which contain the 
plasmid pIKlO (20) or pSW3 (22) is diluted in a ratio of 
about 1:100 using LB medium (J.H. Miller, Experiments in 
Molecular Genetics, Cold Spring Harbor Laboratory, 1972) 
which contains 50 ^g/ml of ampicillin and the growth is 
followed by means of OD measurement. At OD = 0.5, the 
culture is adjusted to 1 mM IPT6 and the bacteria are 
centrifuged off after 150 to 180 minutes. The bacteria 
are boiled for 5 minutes in a buffer mixture (7 M urea, 
0.1% SDS, 0.1 M sodium phosphate, pH 7.0) and samples are 
applied to an SDS gel electrophoresis plate. After 
analysis by gel electrophoresis, an additional band is 
observed in the region from about 10 Kd, which corres- 
ponds to the fusion protein expected. This band reacts 
with antibodies directed against insulin in the "Western 
Blot" experiment. If the ceils are disintegrated under 
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pressure and the debris is centrifuged off, the fusion 
protein is found in the sediment in addition to other 
insoluble cell constituents « 

The induction conditions indicated apply to shaken 
5 cultures ; with larger fermentations the choice of other 

media and conditions, for example in order to obtain 
changed O.D. values, is expedient. 

Example 4: 

a) Preparation of mono-Arg insulin 

10 40 g of the fusion protein concentrated by centrifugation 
and washing with phosphate buffer (pH 7) or water (dry 
sxibstance content about 25%) are dissolved in 75, ml of 
98-100% strength phosphoric acid and 5 g of BrCN are 
added. After reacting for 6 hours at room temperature, 

15 2 1 of water are added to the mixture and it is freeze- 
dr ied • 

The fragment mixture (10 g) is dissolved in 1 1 of buffer 
solution (8 M urea, 0.2 M tris-HCl (pH 8.5)), warmed to 
SO'^C and 10 g of sodium sulfite and 2.5 g of sodium 

20 tetrathionate are added. After 90 minutes at 30°C, 3 1 
of cold water are added and the pH is adjusted to 7.0. 
The resulting precipitate is centrifuged off. The hexa- 
S'Sulfonate of the mini-proinsulin is precipiated from 
the supernatant by adjusting the pH to 3.5. The mixture 

25 is centrifuged after incubating for 15 hours at +4'*C. 

The precipitate is washed with 200 ml of water and 
freeze-dried. 4.8 g of a substance mixture in which a 
mini-proinsulin content of 900 mg is determined by RP- 
HPLC are obtained. The concentration of the S sulfonate 

30 is carried out in two steps: 

1. Anion exchange chroma trography through a 5 x 60 cm 
column containing •Practogel TSK DEAE 650 M in 3 M 
urea; 0.05 M tris-HCl (pH 8.3). The elution is 



performed using a gradient of 0.05-0.5 M NaCl (each 
6 1). After analysis of the eluate by isoelectric 
focusing, the product is precipitated from the com- 
bined fractions by diluting to 1 M urea and adjusting 
the pH to 3.5. 

2 • Removal of high and low molecular weight impurities by 
gel filtration through •sephacryl S200 in 3 M urea; 
0.05 M tris-HCl; 0.05 M NaCl (pH 8.3). Analysis of 
the fractions and isolation of the product are carried 
out as in the preceding step. The precipitate is 
washed with 20 ml of water and freeze-dried. 1.10 g 
of product cocentrated to 69% purity are obtained. 

For folding and disulfide bridge formation, the S sul- 
fonate is dissolved in 50 ml of 8 M urea; 0.02 H tris-HCl 
at pH 0.6. After addition of a few drops of octanol, 
purified nitrogen is passed into the mixture for 15 
minutes. Complete reduction is carried out in the course 
of 1 hour at room temperature by addition of 1.1 ml (16 
mMol) of 2-mercaptoethanol. The solution is applied to 
a ^Sephadex G25 column <5 x 60 .cm) and eluted using 0.05 M 
glycine/NaOH (pH 10.6). The protein fraction in 300 ml 
of the glycine buffer is kept for 2 days at 4**C after 
checking and, if necessary, correction of the pH value 
(10.6). The solution is then adjusted to a pH of 6.8 and 
the solution is incubated at room temperature for 4 hours 
with 1 mg (3.5 U) of trypsin (Merck, treated with L-l-p- 
t08ylamino-2-phenylethylchloromethyl ketone (TPCK) . The 
pH is then adjusted to 3.5, 1 mg of soya bean trypsin 
inhibitor (Sigma) and 3 ml 10% ZnClj are added and the 
solution is readjusted again to pH 6.8. The resulting 
precipitate is separated by centrifugation. It contains 
predominantly mono-Arg insulin which is purified by ion 
exchange chromatography on S-Sepharose* (2.5 x 40 cm) in 
a buffer composed of 50 mM lactic acid and 30% isoprop- 
anol (pH 3.5). Elution is carried out by means of a 
gradient of 0.05-0.50 M of NaCl (each 1 1). The elutate 
is analyzed by HPLC; the mono-Arg insulin is precipitated 
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from the product-containing fractions after 1:1 dilution 
with H2O by adding 10 ml of 10% ZnClj per 1 1 and adjust- 
ing the pH to 6,8. The precipitate separated by centri- 
fugation is crystallized at pH 6 from a buffer composed 
of 1 g/1 of phenol, 10.5 g/1 of citric acid and 200 mg/1 
of ZnCla- 390 mg of mono-Arg insulin in over 90% purity 
are obtained after freeze*drying the crystals washed with 
some water. 

Example 5x 

Preparation of insulin 

200 mg of mono-Arg insulin (see Example 4) are dissolved 
in 100 ml of 0.05 M tris-HCl YpH 8.5) . 1 U (about 4 /ig) 
of carboxypeptidase B is then added and the solution is 
stirred slowly at room temperature. After 3 hours, the 
human insulin is crystallized by acidifying to pH 3.5 and 
adding 1 ml of 10% ZnClg at pH 5.5. 200 mg of crystalline 
insulin having a purity of more than 85% are obtained. 
This material is subjected to purification by ion exchan- 
ge chromatography -on a column containing Fractogel TSK 
DEAE 650 M (2.5 x 40 cm) in 0.1% ^utensol ON 100 (BASF 
AG; oxethylate of a linear saturated fatty alcohol of 
essentially 12 carbon atoms); 0.05 M tris-HCl (pH 8.3), 
the elution being carried out using a gradient of 0-0.4 
M NaCl (each 11). The insulin is crystallized at pH 5.5 
from the product-containing fractions identified by means 
of HPLC after addition of 10 ml of 10% ZnClj and 1 ml of 
10% citric acid* After slowly stirring overnight, the 
mixture is centrifuged and the sediment obtained is 
recrystalized at pH 5.5 from 20 ml of a buffer composed 
of 5 g/1 of citric acid, 125 ml/1 of acetone and 200 mg/1 
of ZnClj. 160 mg of insulin having a purity of more than 
95% are obtained. 
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Example 6^ 

Preparation <xf insulin from mono-Arg insulin by combin d 
use of trypsinNand carboxypeptidase B 

5 mg of mono-Arg iksulin are dissolved in 20 ml of 0*1 M 
5 of tris-HCl (pH 8.o\ and the solution is warmed to 30°C. 

At the same time, 2 .Soil of trypsin solution (1 U/ml) and 
150 /il of carboxypep4/^^^e B solution (1 U/ml) are added. 
After 3 hours, the solution is adjusted to pH 3.5 and 2.5 
m1 of trypsin inhibitor \olution (1 U/ml) and 200 /il of 
10 10% strength ZnCls solution, are added. The human insulin 

is precipitated by adjusting to pH 6.8, centrifuged off 
and crystallized as in ExaWple 5. The crystallized 
insulin has a purity > 95%. \ 

(m Example^! Construction of a yeast expression vector 

15 The DNA sequence (23) (Table 3) is first synthesized by 

the phosphite method. This DNA sequence (23) encodes for 
the amino acids 49 to 80 of the MFo precursor protein and 
corresponds essentially to the natural DNA sequence. 

The DNA sequence (23) is first used as a probe for the 
20 isolation of the gene for the a factor and is labeled 

with for this purpose. With the aid of this probe, 
the gene is isolated from a genomic Agtll yeast gene bank 
(as are meanwhile commercially available and obtainable, 
for example, at Clontech Laboratories Inc., 4055 Fabian 
25 Way, Palo Alto, CA94303). To this end, Agtll phages 
which carry the a factor gene are identified in a plaque 
hybridization experiment. Phages from plaques identified 
as positive are isolated, replicated and the DNA ob- 
tained. This is cleaved using EcoRI and analyzed on a 
30 0.8% strength agarose gel. The membrane is hybridized 

against the "P-labeled DNA sequence (23) by a "Southern 
transfer- experiment. Phage DNA which contains a frag- 
ment of about 1.75 kb (24) which is hybridized against 
the DNA sequence (23) is again cleaved with the enzyme 
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and the corresponding fragment (24) is Isolated. The 
vector pUC 19 is opened using EcoRl (25) and reacted with 
the 1.75 kb fragment (24) using T4 ligase. The cloning 
vector (26) is obtained. 

The strain E. coli 79/02 is transformed using the liga- 
tion mixture. White colonies are isolated, the plasmid 
DNA is obtained from these and plasmids (26) which 
contain the 1.75 kb EcoRI fragment, are identified. 

The natural DNA sequence of the precursor protein for MFa 
contains a PstI cleavage site in the region of amino 
acids 8 to 10 and a TaqI cleavage site in the region of 
amino acids 48/49. From the isolated plasmid DNA (26), 
the fragment (27) which encodes for the amino acids 9 to 
48 of the MFo precursor sequence. is isolated by reaction 
with PstI and Taql. The vector pUC18 is opened using 
PstI and Kpnl and reacted with the Pstl-TaqI fragment 
(27) and with the synthetic DNA sequence (23) with the 
aid of T4 ligase. E. coli 79/02 is transformed using the 
ligation mixture. The transformation mixture is plated 
out on to IPT6-Xgal-Ap-plates. White colonies are 
isolated and the plasmid DNA of this clone is charac- 
terized by restriction analysis. The cloning vector (29) 
which encodes for the amino ac ids 8 to 80 of the MFo 
precursor sequence is thus obtained. 

The encoding sequence (30) mentioned is excised from the 
cloning vector (29) by reaction with PstI and Kpnl and 
incorporated in the ligation described in the following. 
For this purpose, the cloning vector (26) is reacted with 
EcoRI and partially with PstI and the fragment (31) 
including the encoding sequence for the first 8 amino 
acids of the MFa precursor sequence is isolated. Fur- 
thermore, the vector pUC19 is opened using EcoRI and Kpnl 
(32) and ligated with the two fragments (30) and (31) 
described, the cloning vector (33) being formed. This 
^codes for the total precursor sequence of MFa up to 
amino acid 80. 



The cloning vector (33) Is opened using Kpnl and Hindlli 
and the large fragment (34) is isolated • This is ligated 
using the KpnI-Hindlll fragment (35) from the plasmid 
(11) which encodes for the mini-proinsulin. The plasmid 
pIK20 (36), the structure of which is confinned by 
restriction analysis, is thus obtained. 

The plasmid Yepl3 (Broach et al.. Gene 8 (1979) 121) is 
opened using BamHI and the protruding ends are filled in 
with Klenow polymerase (38). The DNA is precipitated 
using ethanol and treated with alkaline bovine phospha- 
tase. 

The fragment encoding for the insulin derivative and the 
precusor sequence of MFa is excised from the cloning 
vector (36) using Hindlll and EcoRl and the protruding 
ends are filled in as described (37). 

The two truncated DNA sequences (37) and (38) are ligated 
with one another, the plasmids pafB102 (39) and pafB104 
(40) being formed. These two plasmids differ only in the 
orientation of the inserted fragment. 

As described in EP-A 0,171,024, a terminator can be 
inserted behind the inserted sequence (Figures 4 to 6 of 
EP-A 0,171,024). For this purpose, the Ncol and/or the 
BamHI cleavage sites are suitable. 

After amplification of the plasmid DNA in E. coli MM294, 
the plasmid pafB102 (39) is transformed in the leucine- 
requiring yeast strain Y79 (ck,trpl,leu2-l) (Cantrell et 
al., Proc. Acad. Natl. Sci. USA 82 (1985) 6250) and DH6-6 
(a/o leu2-3,112t tura3Vleu2i«lys2*, trplVtrpl", his3-ll, 
15/his3-ll, 15, ura3*/ura3", lys2Vly82^, arg4-17/arg4*, 
adel/adel^) (Maya Hanna, Dept. Mol. Biol, Massachusetts 
General Hospital, Boston, USA) by the lithium method o£ 
Ito, H. et al., J. Bacterid., 153 (1983) 163. Colonies 
which can grow on selective medium without addition of 
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leucine, are isolated and combined. Yeast minimum mediiua 
is inoculated with the individual colonies and incubated 
at 28**C for 24 hours. The cells are centrifuged off and 
the supernatant is tested for insulin activity in an RIA 
test. The plasmid DNA is reisolated from yeast clones 
whose supernatant shows insulin activity, and charac- 
terized by restriction analysis. The transformed yeast 
strains are employed for the following expression. 

n 

Example Zi Expression in yeast 

10 ml of yeast complete medium is inoculated with cells 
which have been removed from a fresh overnight culture of 
a strain of selective medium obtained according to 
Example 7, in such a way that an optical density 
ODgoo = 0.1 is achieved. The culture is shaken at 28'*C for 
8 hours, after which 90 ml of fresh medium are added. 
The culture is then shaken for a further 20 hours. The 
cells are centrifuged off and the insulin concentration 
is determined in the supernatant. The conditions are 
modified for a larger fermentation, for example fresh 
medium can be added. continuously. 

Example J^i Purification of mono-Arg insulin from yeast 
supernatant 

The fermentation supernatant is added through an adsorp- 
tion column containing a porous adsorber resin composed 
of a copolymer of styrene and dlvlnylbenzene (•Diaion 
HP 20). The column was previously equilibrated with a 
20-50 mH acetate buffer (pH 5). After washing with tris 
buffer (pH 8), an Isopropanol gradient (0-50%) is applied 
using a 10-fold column volume. Insulin-containing 
fractions are adjusted to pH 6, ^TREX CELLUFINE AM 
(Amicon) is added, and the mixture is stirred and fil- 
tered off with suction and washed with 50 mM acetate 
buffer (pH 6). The wash fraction and the main fraction 
are combined, adjusted to pH 3.5 using lactic acid and 
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added through an S--SEPHAROSE column which has been 
equilibrated using 50 mM lactic acid (pH 5)/30% Isopropa- 
nol • 

The elution is carried out by means of a 0-0 « 6 M NaCl 
gradient. Mini-proinsulin elutes in the range 0*25-0.3 
M. 

The proinsulin-^containing fractions are concentrated to 
1/4 of the volume and added through a column containing 
Biogel PIO (Bio*Rad), equilibrated in 6% acetic acid 
(pH 2). The insulin-containing eluate is lyophilised and 
purified through a preparative "reversed phase" HPLC step 
(RP18 material, 0.1% TFA, acetonitrile gradient 20-40%). 
After subsequent freeze-drying, the lyophilisate is 
dissolved in tris buffer (pH 6.8) and incubated for 3 to 
5 hours at room temperature with 4 units of trypsin per 
gramme of mono-Arg proinsulin. The course of the reac- 
tion is checked by "reversed-phase" analysis. It shows 
that mono-Arg insulin is formed nearly quantitatively. 
At the end of the reaction, the pH is adjusted to 3.5 and 
the reaction is completed by . addition of an equivalent 
amount of trypsin inhibitor. The zinc chloride con- 
centration is then adjusted to 0.21 g/1 and the pH to 
6.8. A flocculent precipitate is obtained which is 
dissolved in lactic acid buffer. The components are 
separated from one another by means of S-SEPHAROSE 
chromatography. Fractions which contain mono-Arg insulin 
are combined and mixed with water in the ratio lil. 
10 ml of 10% ZnCl^ per 1 1 are then added to the solution. 
Mono-Arg insulin then precipitates at pH 6.8 and is 
recrystallized in a known (for insulin) manner. 

Example jrCT: Preparation of human insulin 

A 

The mono-Arg insulin prepared according to Example 9 is 
used as a starting substance for the carboxypeptidase 
B cleavage. To this end, the insulin derivative is 
dissolved in tris buffer (pH 8.5) and 5 units of 
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carboxypeptidase per gramme of mono-Arg insulin are 
added. The reaction is carried out over the course of 3 
hours with slow stirring at room temperature. The 
product is then precipitated with ZnClj as described in 
Example 9. Human insulin is then purified in a known 
manner (DE-B 2,629,568). 
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Table 1< Gene fragment IK I (2) 



10 



Phe 
20 



30 



40 



< - - 2 

CT TTG GAC AAG AGA TTC GTT AAC CAA CAC TTG TGT GGT TCT CAC 
CAT GGA AAC CTG TTC TCT AAG CAA TTG GTT GTG AAC AC A CCA AGA GTG 

< 1 - --> 

(Kpnl) Hpal 



50 60 70 80 90 . . 

"> <---- - --4- 

TTG GTG GAA GCG TTG TAG TTG GTT TGT GGT GAG OCT GGT TTC TTC 
AAC CAC CTT CGC AAC ATG AAC CAA ACA CCA GTC GCA CCA AAG AAG 
< 3 

b30 

Thr Arg Lys Gly Ser Leu 
100 110 120 



TAG ACT CCA AAG ACG CGT AAG GGT TCT CTG CA 
ATG TGA GGT TTC TGC GCA TTC CCA AGA G 
. > 

Mlul (Pstl) 



Table 2t Gene fragment IK II (5) 

Gin Lys Arg Gly 

130 140 150 160 

< - 6 

G AAG CGT GGT ATC GTT GAA CAA TGT TGT ACT AGT ATC TOT TCT 
AC GTC TTC GCA CCA TAG CAA CTT GTT ACA ACA TGA TCA TAG ACA AGA 

< - 5 

(PstI) Spel 



a21 

Asn 

170 180 190 200 210 

> 

TTG TAG CAG CTG GAA AAC TAG TGT AAC TGA TAG TCG ACC CAT GGA 

AAC ATG GTC GAC CTT TTG ATG ACA TTG ACT ATC AGC TGG GTA CCT TCG A 



(Hindlll) 
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Table 3: DNA sequence (23) 

50 55 
5* C GAT GTT GCT GTT TTG CCA TTC TCC 
3' TA CAA CCA CAA AAC GOT AAG AGG 

(Taqi) 

60 65 
AAC AGT ACT AAT AAC GGT TTA TTG TTC 
TTG TCA TGA TTA TTG CCA AAT AAC AAG 

70 

ATT AAT ACT ACT ATT GCT AGC ATT OCT 
TAA TTA TGA TGA TAA CGA TCG TAA CGA 

75 80 
GCT AAA GAA GAA GGG GTA C 3' 
CGA TTT CTT CTT CCC 5' 

(Kpnl) 



